A high-dimension two-sample test for the mean using cluster subspaces
نویسندگان
چکیده
منابع مشابه
On the High Dimensional Power of a Linear-Time Two Sample Test under Mean-shift Alternatives
Nonparametric two sample testing deals with the question of consistently deciding if two distributions are different, given samples from both, without making any parametric assumptions about the form of the distributions. The current literature is split into two kinds of tests those which are consistent without any assumptions about how the distributions may differ (general alternatives), and t...
متن کاملA multivariate two-sample mean test for small sample size and missing data.
We develop a new statistic for testing the equality of two multivariate mean vectors. A scaled chi-squared distribution is proposed as an approximating null distribution. Because the test statistic is based on componentwise statistics, it has the advantage over Hotelling's T2 test of being applicable to the case where the dimension of an observation exceeds the number of observations. An appeal...
متن کاملApproximate sample size formulas for the two-sample trimmed mean test with unequal variances.
Yuen's two-sample trimmed mean test statistic is one of the most robust methods to apply when variances are heterogeneous. The present study develops formulas for the sample size required for the test. The formulas are applicable for the cases of unequal variances, non-normality and unequal sample sizes. Given the specified alpha and the power (1-beta), the minimum sample size needed by the pro...
متن کاملEstimating the First Selected PSU Mean in a Two Stage Cluster Sample- Expanding the PSUs
Scott and Smith (1969) develop estimators for linear functions from a two stage cluster sample, with their discussion repeated in many places. This discussion was reviewed in c01ed13.doc. It is note-worthy that Scott and Smith allow expressions for the variance to depend on the cluster. A similar development is given by Vallient et al.. We repeated the derivation of Vallient et al using the fin...
متن کاملA Multivariate Two-Sample Test using the Jaccard Distance
A common need in statistics is to assess whether two samples come from the same underlying population distribution. Existing two-sample tests often make limiting a priori assumptions, or cannot be easily generalized to multivariate data. We derive a new multivariate two-sample test that makes no a priori assumptions, has higher statistical power than previous tests, has better runtime performan...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Statistics & Data Analysis
سال: 2016
ISSN: 0167-9473
DOI: 10.1016/j.csda.2015.12.004